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METHOD AND APPARATUS FOR COMPUTING REED-SOLOMON ERROR 

MAGNITUDES 



CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] Not Applicable 

STATEMENT OF FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
[0002] Not Applicable 
BACKGROUND OF THE INVENTION 

1. TECHNICAL FIELD 
[0003] This invention relates in general to error correction coding and, more 
particularly, to an improved Reed-Soiomon decoder. 

2. DESCRIPTION OF THE RELATED ART 

10 [0004] Modem digital communication systems typically make use of error 
correction coding (ECC) schemes to improve system performance under 
imperfect conditions. For digital magnetic storage devices, for example, signals 
on the magnetic medium may degrade over time, or imperfections in the 
magnetic medium may cause recording errors. For optical storage devices, such 

15 as compact disks (CDs) and digital video devices (DVDs), physical wear and tear 
may result in misreading data. In both these cases, some errors can be correct 
through the use of a proper ECC scheme. 
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[0005] Perhaps more importantly, in digital communication systems, such as 
satellite communications, mobile phones, smart phones, mobile processing 
systems, and high-speed modems, errors can be introduced through the 
transmission medium. For clear, error-free transmissions, it is important to have 
5 a robust ECC scheme. 

[0006] Reed-Solomon error correcting codes are used throughout the 
electronics industry to provide error correction, largely because of their relative 
simplicity of impleraentation and their ability to detect and correct multiple 
errors, especially for channels subject to burst errors. Because, Reed-Solomon 
error correction is word-based, a single error encompasses any or all errors 
within a single word, as opposed to other correction techniques in which each bit 
error is an independent error event. 

[0007] Nonetheless, the increasing speeds of data transmission require 
increase speeds for error correction. Reed-Solomon error correction involves 
complicated matrix operations that must be performed in real time in most cases. 
In many devices, it is desirable to perform the Reed-Solomon error correction 
within a programmable processing device, such as a DSP (digital signal 
processor). Accordingly, techniques for improving the speed in which the matrix 
calculations can be performed are extremely valuable. 

20 [0008] Therefore, a need has arisen for providing an improved Reed-Solomon 
decoder. 
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BRIEF SUMMARY OF THE INVENTION 

[0009] In the present invention, for Reed-Solomon decoding, vector of v 
syndromes E/ and v error locations Ij are determined from a received codeword, 

and error magnitudes at the v error locations can be determined from the 
equation E^=^e^ a' , where a is a primitive of the codeword. Error 

magnitudes are determined by triangularizing a vxv Vandermonde matrix of the 
elements a' to generate elements of a matrix V, generating a syndrome vector 
W of syndromes Ei, adjusted for the triangularization of matrix V, and generating 
a solution to an equation of a form Vx M=W, where M is a vector of the error 

magnitudes ^/^ and Vx is a vector of matrix V, having a single unknown error 

magnitude. Substitution is used to create other equations of the form Vx M=W 
having a single unknown that can be solved for a respective error magnitude. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0010] For a more complete understanding of the present invention, and the 
advantages thereof, reference is now made to the following descriptions taken in 
conjunction with the accompanying drawings, in which: 

[0011] Figure 1 illustrates a general Reed-Solomon error correction system as 
known in the prior ait; 

[0012] Figtire 2 illustrates a block diagram of a Reed-Solomon decoder; 

[0013] Figure 3a illustrates a flow chart describing a preferred embodiment of 
triangularizing a root matrix; 

[0014] Figure 3b illustrates a table showing the results of the operation of 
Figure 3a for a 3x3 matrix; 

[0015] Figure 4a illustrates a flow chart describing a preferred embodiment of 

adjusting a syndrome vector pursuant to triangularizing the root matrix; 

[0016] Figure 4b illustrates a table showing the results of the operation of 
Figure 4a for a 3x3 matrix; and 

[0017] Figure 5 illustrates a block diagram showing the overall calculation of 
the error magnitudes. 
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DETAILED DESCRJLPTION OF THE INVENTION 

[0018] The present invention is best understood in relation to Figures 1 - 5 of 
the drawings, like numerals being used for like elements of the various 
drav^ings. 

5 [0019] Figure 1 illustrates a block diagram of a system 10 using Reed- 
Solomon error correcting codes. Data is input to a Reed-Solomon encoder 12. 
The encoder 12 receives data blocks having k symbols of s bits per symbol and 
adds 2t parity symbols to result in a codeword of n-k-^lt symbols. The 
maximum codeword length for a Reed-Solomon code is n'=2^-l. Thus, using 8-bit 

^ 10 (byte) symbols, the maximum code length is 255 bytes. The degree of error 
protection can be managed by allocating the number of data symbols (fc) and 
number of parity symbols (2t) within the codeword. A Reed-Solomon decoder 

^ can correct up to t symbols that contain errors. 

i 

§ [0020] Reed-Solomon codes are generated using a polynomial of the form: 

j^lj 15 g(x)=(X'a^)(x-a^)(x-a^),. . (x-a^^-^), where a is referred to as a ''primitive'', 

{ J and the codeword is generated as: 

P c(x)=g(x)'i(x), where i(x) is the data block. 

[0021] The encoded data, c(x), is transferred through, or to, a medium 14, 
such as a conununication channel or a storage medium. At this point, noise or 
20 other errors, e(x), may enter the encoded data stream. The data stream received 
at the Reed-Solomon decoder 16 is thus r(x)=c(x)+e(x). The Reed-Solomon 
decoder 16 decodes the data and corrects errors. 

[0022] Figure 2 illustrates a general block diagram of a Reed-Solomon 
decoder 16. These blocks could be implemented in hardware or in a 
25 programmable processor such as a DSP (digital signal processor). The first block 
is a syndrome calculator 20. The syndrome calculator 20 determines whether 
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there are any errors in the received code word r(x). Since the codeword is 
generated through multiplication with the generator polynomial, if the received 
code word is error free then its modulus with respect to the generator 
polynomial should evaluate to zero. If, on the other hand, the remainder is non- 
5 zero, there are errors in the received codeword. The remainder is called the 
syndrome. 

[0023] The syndrome calculator 20 calculates the 2t known coefficients of the 
Fourier transform E(x) from the received message. These are known as the 
"syndrome coefficients'' and denoted Si, 

10 [0024] The next block is the error locator polynomial 22. The error locator 
m polynomial has roots that give the error locations. The two most popular 

■J 'br 
■S55r. 

¥ I methods of finding the error locater polynomial are Euclid's Algorithm and the 

US 

*C Berlekamp-Massey Algorithm. 

m 

[0025] The next block is the error locations block 24. In this block, using the 

I - - 

pi 15 error location polynomial from block 22, the roots of the error location 
^ polynomial are determined, typically using a Chien search, 

[0026] The next block is the error magnitudes block 26. At this point, the 
location of the errors is known, but not the errors themselves. The syndromes 
and the error polynomial roots are used to derive the error values. In the prior 
20 art, this is typically accomplished using the Forney method. The present 

invention presents a different method of computing the error magnitudes that 
improves performance. 

[0027] The final block is the error correction block 28. After the error 
magnitudes block 2.6, the error polynomial e(x) is known completely and can be 
25 subtracted from the received polynomial r(x) to yield the codeword c(x). 
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[0028] It should be noted that Reed Solomon structures use Galois arithmetic. 
Some DSPs (digital signal processors) perform Galois operations directly, while 
others can be programmed to implement Galois arithmetic. 

[0029] The error magnitude block solves the equation: ^' ' ' where v 

is the number of errors, Z; are the error locations, are the error magnitudes, 

and El are the syndromes. This is a system of v linear equations with v 
unknowns - the values of the v coefficients at the known error locations. In 
matrix form, it can be described as: 



V' ... a"- 

■ • • 








• • * 






A. 



(1) 



[0030] The equation set forth above is the commonly used matrix equation for 
solving for e(x). The method set forth below provides an efficient way of solving 
the matrix equation, which can be particularly useful for software 
implementations using a DSP. 

[0031] For simplification, the notation Ax is used in place of ; hence the 
first matrix (the error polynomial coefficient matrix) in equation (1) can be 
rewritten as: 

Al' A2^ Av^ 

AV AT ... AV _ 

[0032] This vxv matrix is a Vandermonde matrix and can be efficiently 
triangularized by the methods discussed below. By triangularizing this matrix, 
the determination of the error magnitudes can be simplified. In this case, an 
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upper triangular matrix is shown, but a lower triangular matrix can be used as 
well. The triangularized matrix for v=A is: 

'Al Al A3 A4 

A2{A2 + A\) A3{A3 + A\) AA{AA + A\) 

A3{A3 + Al){A3 + A2) AA{AA + A\){AA + Al) 

A4{A4 + Al){A4 + A2){A4 + A3)_ 

[0033] The triangularization of the error polynomial coefficient matrix 
requires 1+2+3+.. .+(v-l) (Galois) multiplications and l+2+3+...+(v-l) (Galois) 
additions. To preserve the equation, the same operation are performed on the 
other side of equation (1); hence the syndrome vector is rewritten as: 

El 

El + Al'^El 
E3 + AI* E1^-A2*-{E1 + A\* E\) 
_£4 + ^l*£3 + ^2* (£3 + ^l*£2) + ^3(£3 + ^l*£2 + ^2*(£2 + ^l *£!))_ 

[0034] These computations can also be done using 1+2+3+ . . . +{v-l) 
multiplications and the same number of additions. 

[0035] Once triangularized matrixes are formed, the equations can be solved 
with relative ease. Solving at the last row (first row for a lower triangular 
matrix) of the error polynomial coefficient matrix yields an equation with one 
unknown, namely (for the 4x4 matrix example): 
A4{A4 + Al){A4 + A2){A4 + A3)e,^ = 

E4 + Al*E3 + A2*(E3 + Al*E2) + A3(E3 + Al*E2 + A2*(E2 + Al*El)). 
The equation is easily solved for : 

_ E4 + Al*E3 + A2*(E3 + Al*E2) + A3{E3 + Al*E2 + A2*{E2 + Al* El)) 
~ A4{A4 + Al)iA4 + A1){A4 + ^3) 
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[0036] The next row of the matrix equation can be solved for e/^^ , as there 
will be only one unlcnown, since the value of is now known. By back- 
substituting (using 1iie solutions to create additional equations with a single 
unknown), each error magnitude can be solved as an equation of a single 
unknown. The computation of all the error magnitudes would require v 
divisions, (v-l)+(v-2)+.,,'^l multiplications and the same number of additions. 
Hence, the number of operation in computing the error magnitudes using matrix 
techniques is proportional to v^, as it is in the Forney method. 

[0037] A more generahzed implementation of the triangularization 
techniques are now described. In the following discussion, R(Ax)2 indicates a 
replicate function, found on some DSPs as a native command that replicates a 
scalar across a vector z times. Hence R(A1)3 would yield [Al Al Al]. Further, 
the vector multiplication and addition is element-by-element; hence if 
V1=[A1 A2 A3]andV2=[A3 A2 Al], then V1*V2=[A1A3 A2A2 A3A1]. 
Likewise V1+V2=[A1+ A3 A2+A2 A3+A1]. 

[0038] The vxv error polynomial coefficient matrix can be triangularized 
using the formula Vn=(V(n-i; +R(A(n-2))t;-«+i)*V(n-l) with VI being the vector of 
the top row of the matrix and Vv being the vector of the bottom row of the 
matrix. 

VI -{Al A2 A3 ... Av} 
V2 = (V1+R(A1))*V1 
V3 = (V1+R(A2))*V2 
V4 = (V1+R(A3))*V3 

Vv = (Vl+R(AvVn+i)*V(v-l) 

[0039] Figure 3a illustrates a flowchart showing the computations for the 
triangularized error polynomial coefficient matrix. In step 30, the top row of the 
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error polynomial coefficient matrix is defined as {Al A2 A3 . . . Av). The index, 
n, is set to ''V. In block 32, the index n is incremented and in block 34, the next 
row of the error polynomial coefficient matrix is computed as Vn=(V(n-l) 
+R(A(n-l))y-n+2)*V(n-l). Steps 32 and 34 are repeated until the final row {n=v) is 
5 computed in decision block 36. As can be seen, the computation of the triangular 
matrix is structured such that the previous row (V(n-l)) is multiplied (element- 
by-element) by Vl+R(A(n-l), Accordingly, a minimum of calculations are 
performed to compute each successive row. Figure 3b illustrates a table showing 
the calculation of elements for a 3 x 3 error polynomial coefficient matrix. In the 

U 10 table an "X" represents a "don't care'' for elements of the triangular matrix that 

O 

p will be set to zero. 

B 

fU 

? T. X 

ffi 15 [0041] The operations performed on the syndrome vector can be described as 



20 



[0040] On a DSP, such as the TEXAS INSTRUMENTS C6XHP family of 

processors, computing each successive vector would take two instruction cycles. 
Triangularizing an 8 x 8 matrix, therefore, would take about 16 cycles. 



^ follows: 

H T0={1) 



Tl=R(Al)i*TO + T0«1, U1=T0*{E1}, W[l]=sum(Ul) 
T1=R(A2)2*T1 + Tl«l, U2=T1*{E2 El}, W[2]-sum(U2) 



T(n)=R(A(n))n*T(n-l) + T(n-l)«l, U(n)=T(n-l)*{E(n) E(n-1) ... El}, 
W[n]=sum(U(n)) 

where T(n-l)«l is the previous value of T, left-shifted and right-filled 
with a "0". In the addition of R(A(n))n*T(n-l) + T(n-l)«l, the value of 
25 R(A(n))n*T(n-l) is left-filled with a "0" so that the vectors are the same size. Once 
again, additions and multiplications between vectors are element-by-element. 
Sum(U(n)) is the sum of all elements of U(n). 
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[0042] Figure 5a illustrates a flow chart describing the recursive calculation of 
the syndrome vector. In step 40, the index n is set to "0"and an initial vector TO 
is defined. In step 42, the index is incremented and in step 44, T(n) is defined as 
T(n)=R(A(n))n*T(n-l) + T(n-l)«l. For Tl (i.e., n=l), this calculation would be 
equivalent to {A1}*{1}+{1 0}. Therefore, T1={1 Al}, In step 46, U(n) is 
calculated as U(n)=T(n-l)*{E(n) E(n-l) El}. Forn=l, U(n)={l}{El}={El), In 
step 48, W(n) is calculated as W(n)=sum(U(n)). For n=l, U(n)=El. The modified 
syndrome vector is set to: 

>r 

Wv 

[0043] Decision block 50 repeats steps 42 through 48, until all elements of the 
modified syndrome vector are calculated. As can be seen, each element is 
calculated from a previous T vectors in steps 44 and 46. 

[0044] For n=2, the new value of T is calculated as the previous value of T - 
{1 Al} - multiplied element-by-element with {A2 A2} to render {A2 A1A2}. 
This is added to the left-shifted T(n-l) - {1 Al 0} to render {1 A1+A2 A1A2}. 
U2 is calculated as \1 Al} multiplied by {E2 El} to render {E2 AlEl}. Hence, 
W2=E2+A1E1. Figures 4b and 4c illustrate the calculations for a 1 x 3 matrix. 

[0045] On a modern DSP, such as the C6XHP family of processors, computing 
each successive T and U vector takes two instruction cycles (two shifts, two 
XORs, four GF(256) multiplies, and two replicates). Computing W[2] would take 
a shift and an add, W[3] and W[4] would take two shifts and tv^^o adds each, and 
W[5-8] would take two shifts and three adds each. Thus for a 1x8 matrix, about 8 
instruction cycles would be required to compute the W vector from the U values. 
Including the time to compute the T and U vectors, about 24 instruction cycles 
would be required to compute the W vector. 
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[0046] Figure 5 illustrates a flow chart describing overall operation of the 
error magnitudes block 26. In step 60, the error polynomial coefficient matrix is 
triangularized, preferably using recursive calculations as shown in connection 
with Figure 3a. In block 62, the syndrome vector is modified pursuant to the 
changes made in the error polynomial coefficient matrix. 

[0047] In block 64, the error magnitudes are calculated by substitution. The 
kernel execution time for back-substitution would be about six instruction cycles 
per iteration. The operations would be: 

(I) load reciprocal of diagonal element of triangularized error polynomial 
coefficient matrix; 

(2-5) pack column of triangularized matrix into a vector; 

(6-7) multiply element of W vector by reciprocal (gives error magnitude); 

(8) replicate e^rror magnitude; 

(9-10) multiply colunrn vector by error magnitude; 

(II) add product vector to W vector. 

[0048] Another iteration of step 6 cannot be started until step 11 is complete; 
however, the loading of a reciprocal and packing of a column into a vector can be 
started prior to step 11 to conserve time. The execution time of back-substitution 
would be about f orlty-eight instruction cycles for an 8x8 error polynomial 
coefficient matrix. 

[0049] The worst-case execution time of finding the error magnitudes on a 
C6XHP or similar processor would be about 88 (16+24+48) instruction cycles. 
This could probably be reduced by specific coding for vectors that mostly contain 
zeroes, 

[0050] The present invention provides an alternative method for the Forney 
method of calculating error magnitudes. On modern DSPs, the present invention 
can provide increased speed in the calculation. Whereas the Forney method 
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operates on scalars, the present invention operates on vectors to take better 
advantage of hardware that computes several Galois field at a time. As 
processors are able to operate on larger Galois field vectors, the savings in 
execution time using the present invention increases. 

[0051] Although the Detailed Description of the invention has been directed 
to certain exemplary embodiments, various modifications of these embodiments, 
as well as alternative embodiments, will be suggested to those skilled in the art. 
The invention encompasses any modifications or altemative embodiments that 
fall within the scope of the Claims. 
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